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Abstract 

U ■ This paper characterizes differentiable and subgame Markov perfect equi- 

. hbria in a continuous time intertemporal decision problem with non-constant 

fH I discounting. Capturing the idea of non commitment by letting the commitment 

period being infinitesimally small, we characterize the equilibrium strategies 
by a value function, which must satisfy a certain equation. The equilibrium 
equation is reminiscent of the classical Hamilton-Jacobi-Bellman equation of 
optimal control, but with a non-local term leading to differences in qualitative 
^ ' behavior. As an application, we formulate an overlapping generations Ramsey 

, model where the government maximizes a utilitarian welfare function defined 

as the discounted sum of successive generations' lifetime utilities. When the 
ff^ ■ social discount rate is different from the private discount rate, the optimal 

. command allocation is time inconsistent and we retain subgame perfection as 

I a principle of intergenerational equity. Existence of multiple subgame perfect 

equilibria is established. The multiplicity is due to the successive governments' 
inability to coordinate their beliefs and we single out one of them as (locally) 
renegotiation-proof. Decentralization can be achieved with both age and time 
^ ' dependent lump sum transfers and, long term distorting capital interest income 

■ taxes/subsidy. 
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1 Introduction 



Time inconsistency is present in many dynamic decision making problem^j. The pri- 
mary objective of this paper is to develop a new approach to analyze a class of time 
inconsistent games where time inconsistency is due to non constant discount rates. 
Using this methodology, we characterize the equilibria of the game and, for a partic- 
ular specification of the discount function, we establish their existence and report a 
new type of indeterminacy for this class of games. The framework is the standard 
deterministic and stationary Ramsey general equilibrium model of growth and capital 
accumulation (see |5l])- Time inconsistency is due to the planner's non-constant dis- 
count rates and there are several good reasons for which this may occur. First, when 
the planner is an individual, there is experimental evidence from psychology (e.g. 
Ainslie [1]) which challenges the assumption of constant discount rates. In particular, 
there is robust evidence that people can indulge in immediate gratification even if 
the delayed cost is high. This suggest a revealed discount rate which is declining 
over time which results in a hyperbolic discount function. Second, if the planner is 
a government, utilitarianism naturally leads to non constant discounting and time 
inconsistency in models with multiple generations. The source of time inconsistency 
is the inability of the forward looking governments to account for the preferences 
of the deceased cohortsjf] The idea is intuitively discussed in the context of natural 
ressources management by Sumaila and Walters [59] and, explained more formally in 
the optimal growth context in a model with altruistic and non-overlapping genera- 
tions (Bernheim [8]) and, in a model with non altruistic and overlapping generations 
(Calvo and Obstfeld [H]). 

Following the previous hterature on the topic, Strotz [58], PoUak [53], Phelps and 
Pollak [52] and, Peleg and Yaari [50], we search for subgame perfect equilibria of a 
dynamic game where decision makers cannot bind their choices (non commitment) 
and are aware of their inconsistency problen|§. We restrict the planner to choose pure 
strategies that are continuously different iable functions of contemporaneous capital 
stock, making the framework as close as possible to the standard optimal growth 
model. The notion of non-commitment is not easily defined in continuous time: there 
is no notion of "successor" because no matter how close in time two planners are, 
there is always a third planner who precedes one of them and succeeds the other. As 

^In particular the issue is of economic relevance in the monetary and fiscal policy design of a 
benevolent government (Kydland and Prescot [42], Calvo [13], Fischer [28] and, Chari and Kehoe 
[l9]). the pricing of a durable good for a monopohst (Stokey [57]), the ownership pohcy for a large 
shareholder (DeMarzo and Urosevic [H]), the long term environmental decision making (Chichilnisky 
[20j and, Li and Lofgren [47]) and, the consumption saving private decision under hyperbolic dis- 
count (Laibson [44]) as well as the Ramsey growth model when the social planner himself exhibits 
hyperbolic discount (Krusell, Kuru§Qu and Smith [40]). 

time inconsistency induced by the structure of the preferences is also present for capital 
accumulation models with non overlapping and altruistic generations when the social planner uses the 
maxi-min Rawlasian criterion (Dasgupta [21j). A similar situation also arises for consumption saving 
problems when the preferences over consumption streams are non additively separable (Kihlstrom 
[37]). 

^See Gul and Pesendorfer f30] for an alternative interpretation of the Strotz's model. 
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a result, the discrete time induction approach cannot be used to solve for the equi- 
librium and we have to adapt it to define the concept of equilibrium in continuous 
time. This is achieved with an intuitive construction: we assume that the planner at 
any point in time can control his immediate successors, thereby forming a coalition 
of a given small size that clearly separates the current planner from the more distant 
ones. A strategy is an equilibrium if it is the best strategy for the current planner 
when the coalition is vanishingly small, so long as the same strategy is expected to 
be used by the distant planners. The idea conceptually parallels Aumann's contin- 
uum of consumers [6] and it simply reduces to the Barro's [7j approach when linear 
equilibria exist0 In the first part of this paper (Sections 2, 3 and 4) we provide a 
precise sense in which the game can be formulated and, a characterization of equilib- 
rium policies. The characterization consists of an instantaneous saving-consumption 
indifference condition together with a non linear differential equation displaying a 
non local term (that is a term which depends on the global behavior of the solution). 
The non local term reflects the strategic motive of investing and, when the discount 
rate is constant it becomes a local term and the equation reduces to the familiar 
Hamilton- Jacobi-Bellman equation (HJB) of optimal control. The equation extends 
the Generalized Euler Equation of Harris and Laibson |.32j concerning the case where 
the planner is facing a countable number of successors to a case where the plan- 
ner is facing a continuum of successors. Notice that an alternative approach could 
have been adopted for our class of continuous time dynamic games. It is possible to 
consider flrst a discrete time game on a time grid, deflne the equilibrium by using 
induction methods on the grid, and consider the equilibrium policy resulting from 
letting the gird be vanishingly flne. This approach has been successfully adopted by 
Luttmer and Mariotti [IH] for linear equilibria of consumption-saving games under 
uncertainty!^ Indeed the linear equilibria of Luttmer and Mariotti [H] are mutually 
consistent with ours in a deterministic version of their model. We did not pursue this 
path here because the approach that we undertook uses simple marginal calculations 
involving standard differential calculus techniques and does not require an extraneous 
convergence theorem. More importantly, it directly leads us to an equilibrium char- 
acterization that mirrors the HJB equation of optimal control which suggests that 
our method is the "adequate" notion of dynamic games in continuous timeO 

What appears to be new in this flrst part of the paper (Sections 2, 3 and 4) rel- 
ative, for instance, to Barro [7| and, Luttmer and Mariotti |:48j is the formal model 
of the game in continuous time and the resulting general characterization of equi- 
librium policies. The framework allows for non linear policies and the equilibrium 
characterization addresses directly the Pollak's criticism of Strotz's work (see section 
4 of PoUak |53j). More importantly, we believe the general methodology that we de- 
velop here can be applied in a broad set of time inconsistent problems including the 
problem of optimal flscal policy. To illustrate the usefulness of the methodology, we 



^See also Phelps [51] for an early discussion on the issue. 

^ Simon and Stinchcombe |56j also used the infinitesimal grid approach to define a more general 
class of games and history dependent strategies in continuous time. 

^ Given the convergence result of Luttmer and Mariotti |48 : for discrete time linear equilibria to 
our solution, there is a suspicion that the convergence holds for non linear equilibria as well. 
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develop an application in the context of the Blanchard [12] model of economic growth 
with finite life (the perpetual youth model). The Blanchard model is a continuous 
time version of the overlapping-generations models of Samuelson [55j and Diamond 
[23] offering more tractability for the aggregation of individual variables. We explore 
the dynamic allocation problem of a utilitarian government (the planner) maximizing 
the discounted sum of the surviving and unborn generations' lifetime utilities. As we 
mentioned earlier, implementation of utilitarian welfare optima in overlapping gener- 
ation models is problematic. Except in few special cases, utilitarianism renders the 
planner's objective time inconsistent. Time consistency is achieved for example when 
the planner's discount rate is equal to the individual discount rate or if the planner's 
discount function satisfies some unnatural assumptions (Calvo and Obstfeld [H]). 
The purpose of our application is to analyze the centrally planned allocation when 
the social discount function is aligned with the way in which economists usually model 
utilitarianism that is, an exponential forward looking discount function with a con- 
stant rate of growth. The optimal command is then time inconsistent and the planner 
is forced to approach the allocation decision as a strategic problem. Our principle 
of intergenerational equity is then anchored in the concept of subgame perfection 
of the extensive form game of successive planners. This approach has already been 
taken for Rawlasian welfare (see Dasgupta [21], Lane and Mitra [16] and, Asheim 
[3]). To our knowledge, our application is the first to address this preference based 
time inconsistency friction in an overlapping generations economy with a utilitarian 
government. 

We now summarize the main finding from our application (section 5) that we 
carry out in two steps. 

First, we describe how the centrally planned economy operates. The government 
has to solve two questions: how to allocate consumption across the surviving cohorts 
and how to allocate aggregate consumption over time. The inspection of this dual 
decision shows that when the allocation across surviving generations is restricted to 
be stationary and linear in aggregate consumption, the problem of allocating the 
aggregate consumption over time is in fact isomorphic to an infinitively lived repre- 
sentative agent growth model with non constant discount of the type analyzed in the 
first part of the paper. The social discount function takes the form of a mixture of two 
exponential functions. Due to this special structure we can reduce the equilibrium 
characterization from the first part of the paper to a system of two coupled ordinary 
differential equations which do not involve non local terms. Taking advantage of this 
special structure and using the central manifold theorem (Carr [IT]), we then prove 
the existence of multiple equilibria. The equilibrium policies are continuous and dif- 
ferentiable in the capital stock and they inherit the smoothness (in a sense to be made 
precise) from the underlying production function. Equilibrium multiplicity results in 
a continuum of possible steady state level of capital stock within the range of an 
open and bounded interval. When the planner's discount rate is equal to the individ- 
ual discount rate, the interval shrinks to one point, and the capital stock converges 
to its modified golden rule level. The driving factor for multiplicity is the govern- 
ments' inability to coordinate their expectations on any policy. When adapted to our 
continuous time framework, renegotiation-proofness is restrictive as in Kocherlakota 
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[38] . Under this refinement, all successive governments use in agreement, amongst all 
equilibrium policies, the one that induces the capital stock to converge to the highest 
steady state level. The renegotiation-proof steady state level of capital stock depends 
on the government discount rate, the private discount rate and the individuals' life 
expectancy. When the government discount rate converges to 0, the capital stock 
path resulting from the renegotiation-proof equilibrium converges to its golden rule 
level. 

Second, we discuss how to find a tax schedule that, in a market with actuarially 
fair annuities (Blanchard [12]) places the economy on the desired disaggregate path 
of accumulation. If the social discount rate is equal to the private discount rate, dis- 
tortionary taxation is not required by the second welfare theorem. If, as we suppose 
in our application the government is more patient than the individuals0 the time 
inconsistency problem creates a wedge between the cost and benefits of saving at 
the individual level. Under this assumption, the laissez-faire economy cannot decen- 
tralize the allocation and we show that such decentralization can be achieved if the 
government uses a date and age conditioned lump sum taxation and distorting capital 
income taxation. We provide a closed-form expression for the required path of capi- 
tal income tax rates that shows that its long term level can be positive or negative. 
The result is clearest in one specification of the model where the the consumption 
allocation across the surviving cohorts is egalitarian. Under this specification, as in 
the infinitely lived identical agents economy, there is no heterogeneity in consumption 
at any point in time. Yet the government finds it optimal to subsidize the capital 
income with an age independent rate, including in the steady state. The motive for 
the subsidy is to give the savings incentives to the private agents who, from the point 
of view of the government, are not saving sufficiently. The qualitative conclusion 
from our application is that the preference based time inconsistency friction faced 
by a utilitarian government creates in its own a role for capital taxation in the long 
term. The results is in stark contrast with the benchmark infinitively lived agents 
economy where capital income taxation should be zero in the long term (Judd [53] 
and Chamley [IB])- 

Related Literature. Before turning to the model, we summarize how our work 
relates to the literature on which it builds. Our main result is the existence of multiple 
equilibria in a dynamic game. We introduce a definition of continuous time games and 
we prove existence by using a new approach, based on the central manifold theorem 
(Carr [IT]), in a deterministic Ramsey growth model. Our paper naturally relates to 
the game theoretic literature on Markov-consistent plan (MCP). An MCP is a sub- 
game perfect equilibrium of the extensive form game between the successive planners 
in which the strategy is pure and only depends on the payoff relevant variables (capital 
stock), the MCPs do not always exist in time inconsistent decision making problems. 
Peleg and Yaari [5U1_^ constructed a simple finite horizon intra-personal game where 
MCPs do not exist Linear MCPs have however been reported in the hyperbolic 

^See Caplin and Leahy [15j for a non paternalistic argument supporting the idea that social 
planner can be more patient than private individuals. 

^However, more general history dependent equilibrium strategies still exist as Goldman |29] 
showed in a finite horizon setting and Harris [31] showed in an infinite horizon setting 
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discount literature! in a frictionless consumption saving problems with homothetic 
time additive utilities and linear production functions. In a deterministic setting, lin- 
ear MCPs have been reported in Laibson |i43j in a discrete time consumption saving 
problem as well as in Barro [7] in the context of an infinite horizon continuous time 
decentralized version of the Ramsey growth model. Luttmer and Mariotti [48j also 
reported linear MCPs in an infinite horizon endowment asset pricing model with un- 
certainty. The aggregate temporal allocation problem of our overlapping generations 
growth model may be interpreted as a consumption saving problem with hyperbolic 
discount and a non linear state dynamic for wealth. With this interpretation, our 
application can be seen as a an extension of Laibson [13], Barro [7] and Luttmer and 
Mariotti [H] to non linear MCPs. Our results suggest then that the observational 
equivalence results implied by the existence of linear MCPs, does not hold when the 
technology is non linear. In the presence of non linear technology, the MCP dynamic 
path emerging from our analysis are not possible to reproduce with constant discount. 

Non observational equivalence is also present in the infinite horizon buffer stock 
models with income uncertainty, borrowing constraints and hyperbolic discounting 
in discrete time (Harris and Laibson [32]) and continuous time (Harris and Laibson 
[34]). These two papers show existence and uniqueness of smooth MCPs when the 
discount function is sufficiently close to the exponential discount Our results do 
not overlap with theirs since we allow for non linear technology and we do not have 
a borrowing constraint. Methodologically, our proof of existence is different and does 
not require the presence of uncertainty or the discount function to be sufficiently close 
to the exponential function. Our results is also different because multiplicity emerges 
as a central feature of the equilibrium in our context. 

Another type of MCPs existence results is found in the literature on altruistic 
generations growth economies. In this context, existence of MCP has been established 
for a finite horizon setting (Bernheim and Rey [9J) and for an infinite horizon setting 
(Bernheim and Rey [llj) (see also Caplin and Leahy for a recent related MCP's 
existence result under uncertainty). In contrast to our approach, the existence result 
of those papers hinges critically on introducing production uncertainty. 

Our paper also relates to the literature on the Ramsey growth model when the 
central planner himself displays hyperbolic discount. Krusell, Kuru§gu and Smith 
[lO] undertook an elegant comparison of a decentralized and a centralized version of 
the Ramsey model with quasi-geometric discount, in discrete timj"1. The analysis 
in Krusell, Kuru§gu and Smith [ID] does not cover the fundamental issue of multiple 

^This literature shows that apparent irrationahty of individuals, even in financial markets, can 
be ascribed to the fact that the psychological discount factor is not exponential; see Laibson [45] . 
Harris and Laibson [33], Diamond and Koszegi f25] and others. 

^°When the discount function is quasi-geometric (Phelps and PoUak [S2]) and sufficiently close to 
a geometric one, Harris and Laibson [32. used tools from the bounded variations calculus to Show 
existence and uniqueness of the equilibrium strategies. Using alternative techniques, Harris and 
Laibson [34j proved existence and uniqueness of smooth equilibria for a stochastic discount function 
when it is in some sense sufficiently close to the exponential discount function (this is what they coin 
the instantaneous gratification model), ft appears that both proofs require the presence of income 
uncertainty. 

^^See also a related paper by Judd [35] . 
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equilibria because it is assumed that the equihbria must be the hmit of finite-horizon 
equihbria. We do not impose this restriction in our setting and the infinite horizon 
naturally underscores multiple equilibria. Multiplicity is also discussed in Karp ^36] 
and Krusell and Smith [41j. In a continuous time model , Karp [36] obtained the 
MCP's necessary conditions in a growth model with non constant discount rate by 
considering first the equilibrium of a sequence of planners in discrete time and then 
he took the continuous time limit. While the passage to the limit is not mathemat- 
ically justified in Karp [36], his equilibrium necessary conditions are consistent with 
Our incremental contribution relative to his is that we took the necessary theoretical 
steps to define the novel notion of continuous time game and, as a byproduct, we 
proved existence of multiple equilibria. Krusell and Smith [H] also report multiple 
equilibria in a Ramsey growth model with quasi-geometric discount. However, their 
MCPs are supported by discontinuous consumption policies whereas our MCPs are 
continuously differentiable policies. Therefore, our analysis suggests that the mul- 
tiplicity is somewhat more fundamental because it does not need to be structured 
around discontinuous saving rules. 

Finally, our application also relates to the literature on optimal fiscal policy. In the 
context of a growth model with infinitely-lived individuals, Judd [35j and Chamley 
[T8] established that capital taxation should be zero in the long term. In contrast 
to these result, we suggest that in overlapping generations economies, the preference 
based time inconsistency friction faced by the government creates a role for long 
term capital income taxation]^ However, our model ignores important aspects of the 
governments policy tradeoffs that could result in a myriad of alternative motives for 
capital income taxation. For example, the Mirrlees approach to optimal taxation (see 
the papers surveyed in Kocherlakota [22]) taught us that taxing capital income can 
be required if the planner is facing an informational friction due to the unobservable 
private skills or productivity. In the closer (non Mirrleesian) context of overlapping 
generation, Erosa and Gervais [26] rationalize capital income tax as an indirect leisure 
tax. Nonetheless, we aimed to argue that the preferences based time inconsistency 
faced by the government by itself creates a role for capital income taxation and we 
hope that the point is clearest in our simple context. 

The rest of the paper is organized as follow. The next section presents the basic 
model and discuss the issue of time inconsistency. We define the continuous time game 
in Section 3 and provide the equilibrium characterization in Section 4. In Section 5 
we undertake our application to the overlapping generations model. The last section 
concludes. 

""^^The literature on optimal governmental policy with overlapping generations includes Diamond 
[24] , Atkinson and Sandmo [4] , Auerbach and KotlikofF [5] , Erosa and Gervais [26] . 
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2 The model 



2.1 Preferences and production 

We consider a deterministic stationary environment where time is continuous. A 
decision maker derives utility from a consumption schedule rate c, and the date t 
utility has the representation 

/■oo 

h{s -t)u{c{s))ds (1) 

for some utility function u and some discount function h. We assume that u is strictly 
increasing, twice continuously differentiable and strictly concave. The discount func- 
tion h is continuously differentiable and positive, with h (0) = 1, h (t) > 0, and 
Jq°° h{s)ds < oo. Note that the discount function between the current time t and the 
consumption scheduling time s depends only of s — t. This assumption implies, as 
in Strotz [58], that the incremental utility of immediate over postponed consumption 
remains invariant with the passage of time. As a result, the representation ([1]) is 
stationary, meaning that 

POD 

h {s — t) u {c (s)) ds = / h (s) u {c{t + s)) ds. 

Jo 

The decision maker strives to maximize the objective ([1]) under the resource constraint 

"^^^'^ f{k{s))-c{s), k{t) = h, (2) 



ds 

where k{s) is capital at time s and, where / is a strictly increasing, concave and 
continuously differentiable production function. 

2.2 Time consistency 

Unless the discount function is exponential, the marginal rate of consumption sub- 
stitution between two future dates will in general change with the mere passage of 
time. To see this, fix the dates ti < t2 < ^3 < ^4 and the consumption rates C3, C4 
and consider the marginal rate of substitution (MRS) between consuming C3 at date 
is and consuming C4 at date ^4. When calculated from the perspective of date ti, the 
MRS is whereas the MRS is Ifr'A^'u^'l from the perspective of date ^2- 

h{t4—tl)u' (04) h{t4~t2)u'(C4) f f z 

The two MRS will be identical if the discount function has the multiplicative property 
= hftl-l^J) ^^^^ niust hold for all ti < ^2 < ^3 < ii- As Strotz [58j pointed 
out, a necessary and sufficient condition for the MRS to be time invariant is that the 
discount function of the exponential form h{t) = e~^* for some constant discount rate 
S > 0. The MRS changes with the mere passage of time with any other discount 
function, and intertemporal consistency fails. As a result, when the discount function 
is not of the exponential form, a consumption schedule (co(s))g>Q seems optimal for 
a decision maker who maximizes the objective ([T]) under the constraint ([2]) at time 
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t = and yet it will not be perceived as such at later time t > 0. So when time 
t comes, there is no reason to expect that the decision-maker will actually consume 
co(t), as the decision-maker at time expected of her, unless, of course, the latter 
has a way to commit the former. In other words, for general discount functions, 
there are a plethora of temporary optimal policies: Each of them will be optimal 
when evaluated from one particular point in time, but will cease to be so when time 
moves forward. As a result, none of them can be implemented, unless one of these 
viewpoints is given a privileged status and the power to lock in policy for all future 
times (which, incidentally, may be regretted afterwards). 

In the absence of a commitment technology, the problem of maximizing ([1]) under 
the constraint ([2]) can no longer be seen as a classical optimization problem. There is 
no way for the decision-maker at time to achieve what is, from her point of view, the 
first-best solution of the problem, and she must turn to a second-best policy. Defining 
and studying such a policy is the first aim of this paper. The path to follow is clear 
and it is consistent with Strotz [58], PoUak [53] and Peleg and Yaari [50]. The best 
the decision-maker at time t can do is to guess what her successors are planning to 
do, and to plan her own consumption c (t) accordingly. We will then be looking for a 
subgame-perfect equilibrium of a certain game. 



3 Equilibrium strategies: construction and defini- 
tion 

We now proceed to define subgame-perfect equilibrium strategies. Paralleling Au- 
mann [6] we will consider the continuum of decision makers over the time interval 
[0, oo). At any time t, there is a decision-maker who decides what current consump- 
tion c (t) shall be. As is readily seen from the equation ([2]), changing the value of c at 
just one point in time will not affect the trajectory. However, the decision-maker at 
time t is allowed to form a coalition with her immediate successors, that is with all 
s G [t, t + e], and we will derive the definition of an equilibrium strategy by letting 
e — > 0. In fact, we are assuming that the decision-maker t can commit her immediate 
successors (but not, as we said before, her more distant ones), but that the commit- 
ment span is vanishingly small. We now construct and define the equilibrium Markov 
strategies. We analyze the problem from the perspective of the decision maker at 
time t = but, given the stationarity of the environment, a similar analysis can be 
carried out at any time t > 0. 

We restrict our analysis to Markov strategies, in the sense that the policy depends 
only on a payoff relevant variable, the current capital stock and not on past history, 
current time or some extraneous factors. Such a strategy is given hj c = a (k), where 
a : R ^ R is a continuously differentiable function. If we apply the strategy a, the 
dynamics of capital accumulation from t = are given by: 

rik 

^ = f{k{s))-a{k{s)), k{0) = ko 
as 

We shall say a converges to k, a steady state of a, if k (s) — > k when s — > oo. 
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when the initial value ko is sufficiently close to k. A strategy cr is convergent if 
there is some k such that a converges to k. In that case, the integral ([T]) is obviously 
convergent, and its successive derivatives can be computed by differentiating under the 
integral. This assumption is not required but it will greatly simplify the exposition, 
and for this reason we will restrict our attention to convergent strategie^. Note that 
if a converges to k, then we must have / (k) = o (fc) . 

Let us now proceed to the definition of equilibrium strategies. A convergent 
Markov strategy c = a (fc), where a : i? — i? is a continuously differentiable function, 
has been announced and is public knowledge. The decision maker begins at time 
t = with capital stock k. If all future decision-makers apply the strategy o", the 
resulting capital stock k^ future path obeys 

r\k 

^ = f(koit))-aikoit)), t>0 (3) 
at 

ko (0) = k. (4) 

We suppose the decision-maker at time can commit all the decision-makers in 
[0, e] , where e > 0. She expects all later ones to apply the strategy a, and she 
asks herself if it is in her own interest to apply the same strategy, that is, to consume 
a (k). If she commits to another bundle, c say, the immediate utility flow during [0, e] 
is u (c) e. At time e, the resulting capital will be A; + (/ (k) — c) e, and from then on, 
the strategy a will be applied which results in a capital stock kc satisfying 

Hk 

^ = f{k,{t))-a{k,{t)), t>e (5) 
k,ie) = k+ifik)-c)e. (6) 

The capital stock kc can be written as kc (t) = ko (t) + ki (t) e where 

dk^ 

-^ = {f'{ko{t))-a'ikomki{t), t>e (7) 
ki{6) = a{k)-c (8) 

where /' and a' stand for the derivatives of / and a. Summing up, we find that 
the total gain for the decision-maker at time from consuming bundle c during the 
interval of length e when she can commit, is 

POO 

u{c)e+ / h{s)u{(r{ko{t)+eki{t)))dt, 



^^In fact, we can work with a larger set of policies a for which the integral ([T]) is convergent, and 
for which the successive derivatives of ([T]) can be computed by differentiating under the integral. 
The resulting equilibrium characterizations of Section |4] will be identical. 

^■^To see this, plug kc (t) = ko (t) + ki {t) e into ([5]) for t > e, keeping only the terms of first order 
in e, and get 

dk 

-^-fih (t)) + ef (A:o (t)) h (t) ~ <J {ko (t)) ~ ea' {h (t)) h {t) • 



Comparing this with ^ gives Q. Equation ([5]) is obtained by substituting the expansion (a) = 

dkg 
ds 



k + e^{0) = k + sifik)-a (fc)) into dH). 
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and in the limit, when e ^ 0, and the commitment span of the decision-maker 
vanishes, expanding this expression to the first order leaves us with two terms 



h{t)u{a{ko {t)))dt 
u{c)-u {a{k)) + / h{t) u' {a {ko (t))) a' {ko (t)) k^ {t) dt 



+ e 



where ki solves the linear equation 



(9) 



dk^ 

= (/' {ko{t)) - a' {ko{m k,{t), t>0 (10) 
ki {0) = a {k) - c. (11) 

Note that the first term of iQ does not depend on the decision taken at time 0, 
but the second one does. This is the one that the decision-maker at time will try 
to maximize. In other words, given that a strategy a has been announced and that 
the current state is k , the decision-maker at time faces the optimization problem: 

max Pi {k, a, c) (12) 

c 

where 

POO 

Pi{k,a,c) = u{c)-u{a{k))+ h{t)u' {o {ko{t)))o' {ko{t))ki{t)dt. (13) 



In the above expression, ko {t) solves the Cauchy problem ([3]),(jl]) and ki {t) solves the 
linear equation (fTOj) . (ITT]) . 



Definition 1 A convergent Markov strategy a : R R is an equilibrium strategy 
for the intertemporal decision model (Q)) under the constraint ^ if, for every k & R, 
the maximum in problem [W\) is attained for c = a {k): 

a {k) = arg max Pi {k, a, c) (14) 

c 

The intuition behind this definition is simple. Each decision-maker can commit 
only for a small time e, so he can only hope to exert a very small influence on the 
final outcome. In fact, if the decision-maker at time plays c when he/she is called 
to bat, while all the others are applying the strategy a, the end payoff for him/her 
will be of the form 

Po {k, a) + ePi {k, a, c) 

where the first term of the right hand side does not depend on c. In the absence 
of commitment, the decision- maker at time will choose whichever c maximizes the 
second term ePi {k,a,c). Saying that a is an equilibrium strategy means that the 
decision maker at time will choose c = a {k). Given the stationarity of the problem, 
if the strategy c = a {k) is chosen at time 0, it will be chosen at any future time t 
and as a result, the strategy a can be implemented in the absence of commitment. 
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Conversely, if a strategy a for the intertemporal decision model ([I]), ([2]) is not an 
equilibrium strategy, then it cannot be implemented unless the decision-maker at time 
has some way to commit his successors. Typically, an optimally committed strategy 
will not be an equilibrium strategy. More precisely, a strategy which appears to be 
optimal at time no longer appears to be optimal at times t > 0, which means that 
the decision-maker at time t feels he can do better than whatever was planned for him 
to do at time 0. What happens then if successive decision-makers take the myopic 
view, and each of them acts as if he could commit his successors ? At time t, then, the 
decision-maker would maximize the integral ([I]) with the usual tools of control theory, 
thereby deriving a consumption c= {t, k). This is the naive strat egy (O'Donoghue 
and Rabin [H]); in general it will not be an equilibrium strategjO, so that every 
decision-maker has an incentive to deviate. 

4 Characterization of the equihbrium strategies 

The equilibrium strategy can be fully specified by a single function, the value function 
V (k), which is reminiscent of- although different from - the value function in optimal 
control. We will show that the value function satisfies two equivalent equations, the 
integrated equation (IE) and the differentiated equation ( JT9l) . the latter one resem- 
bling the classical Hamilton- Jacobi-Bellman (HJB) equation of optimal control. This 
similarity is reassuring since it shows how standard methods from control theory can 
be adapted to analyze the impact of time inconsistency. Unfortunately, the similarity 
is superficial only, since ( fT9|) is a non-local equation (and not a partial differential 
equation like (HJB)) and we will demonstrate that its solutions exhibit different 
qualitative behavior. In the knife edge case where the discount rate is constant, the 
non local term in (fT9|) collapses, and (fT9|) becomes identical to (HJB). Consequently, 
when the discount rate is constant, the equilibrium strategies are also optimal from 
the perspective of all temporal decision makers. 

Given a Markov strategy a (k), continuously differentiable and convergent, we shall 
be dealing with the Cauchy problem Q, (jlj). The value /cq (t) depends on current 
time t, initial data k, and the strategy a. To stress this dependence, it is convenient to 
write ko (t) = /C (a; t, k) where /C is the flow associated with the differential equation 
(IH]) defined by 



The following theorem characterizes the equilibrium strategies and its proof is 
given in Appendix |Al There are two parts in the equilibrium characterization: a 
functional equation on the value function and an instantaneous optimality condition 

^^Under non-constant discounting, the commitment strategy dn is non stationary and so we must 
extend the definition of the equilibrium to a non stationary strategy. Although, we do not report the 
definition of the non stationary equilibrium, it can easily be done with some additional notations. 



OK {(r;t,k) 
di 



/(/C {a;t,k))-a{IC {a;t,k)) 



(15) 
(16) 
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determining current consumption. Here v' is the derivative of v and i is the inverse 
of marginal utihty u' . As usual, i o v'{x) = i [v' (x)) 



Theorem 2 Let a : R R be a continuously dijferentiable convergent strategy. If a 
is an equilibrium strategy, then the value function 



oo 



v{k):= h{t)u{a{IC{a;t,k)))dt (17) 
Jo 

satisfies, for all k, the functional equation 

POO 

v{k)= I h{t)u{iov' {lC{iov']t,k)))dt (IE) 
Jo 

and the instantaneous optimality condition 

u' {a (k)) = v' {k) , a (k) = i {v' (k)) (18) 

Conversely, if a function v is twice continuously differentiable, satisfies ( TIE]) , and 
the strategy a = iov' is convergent, then a is an equilibrium strategy. 

The instantaneous relation f|T8|) expresses the usual tradeoff between the utility 
derived from current consumption and the utility value of saving. This is a standard 
condition in a world where there is one commodity that can be used for investment 
or consumption. Let us spell out what equation (1IE|) means. Given a candidate 
function f, we must first solve the Cauchy problem (ITSl) . (fT6|) with a = iov' . Second, 
we calculate the right-hand side of equation (llEl) . which is an integral along the 
trajectory of capital stock. The final result should be equal to v{k). Equation (llEl) 
is therefore a fundamental characterization of the equilibrium strategies and it takes 
the form of a functional equation on v. In order to contrast the equilibrium dynamics 
with the dynamics resulting from using the optimal control approach, the following 
proposition gives an alternative characterization, the differentiated equation, which 
resembles the usual Euler equation from optimal control and its proof is given in 
Appendix B. 

Proposition 3 Let v be a function such that the strategy a = i o v' converges to 
k. Then v satisfies the integrated equation (IE) if and only if it satisfies the following 
functional equation 

oo 

h\t)u o i {v\lC {i o v'] t, k)) dt = sup [m(c) + v\k) {f{k) - c)] (19) 

c 

together with the boundary condition 

POO 

v{k) =u{f{k)) / h{t)dt (20) 
Jo 
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It is useful to rewrite the differentiated equation (fT9|) as 



p{k) = ^ sup He) + v'{k) ifik) - c)] (21) 

where 

^(^^ ^ /o°°^^(t)7/(a(;C(a;.,A;)))rft 
Jq h (s) u {(T {K. {(t; s, k))) ds 

is interpreted as an effective discount rate. Equation fl2Tl) then tells us that, along an 
equilibrium path, the relative changes in value to the consumer must be equal to the 
effective discount rate. The effective discount rate is here endogenous to the model and 
its presence reflects the strategic behavior of the current decision maker resulting from 
internalizing the behavior of future decision makers. In order to gain some insights 
into the economic meaning of equation (fT9|) , we first consider the exponential discount 
function h(t) = e~^°^ for 5q > 0. With exponential discounting, h'(t) = —Soh{t), and 
the resulting effective discount rate is just the constant discount rate p{k) = 6q for 
all k. Equation f|T9|) becomes then simply the familiar (HJB) equation 

6ov{k) = sup [u{c) + v\k) ifik) - c)] . (22) 

c 

Second, we consider the case where h is piecewise exponential, h(t) = e~^°* for t < t 
and h(t) = e~^^^ for t > t, with r > and Sq > 5i. The discount rate of this 
discount function is decreasing with time and therefore the willingness to postpone 
consumption at the margin is increasing over time. When the decision maker is naive 
in the sense that he acts as if he could commit the future decision makers and does 
not learn from his past mistakes, his behavior can be described by the (HJB) equation 
fl22]) . In contrast, a decision maker following the equilibrium strategy recognizes that 
future decision makers will spend more than he currently hope and, in reaction to 
that, he may accumulate more wealth than the naive decision maker. Equation (fT9l) 
reflects exactly this idea since, when the discount function is piecewise exponential, 
it can be written as 

Sovik) = sup [uic) + (5o - 5i) g{k) + v'ik) ifik) - c)] (23) 

c 

where 

poo 

gik)= / e-^'^uoiiv'i}Ciiov';t,k))dt. 



The only difference between the naive policy characterization (122!) and the equilibrium 
policy characterization (123|1 is the extra term i6o—Si)gik). Assuming that g is positive 
and increasing in k, it is then evident that the presence of the extra term (5o — S-\)q(k) 
yields additional incentives to save, relative to the naive policy (where (7 = 0)0 



""^^To be more concrete, assume further that the technology is hnear, /(/c) = Ak and the utihty of 
the form u{c) = log(c). Solving ((22|) gives the naive policy 

cr„(fc) = Sok 
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Neither equation (lIEp nor equation (fT9|) are of a classical mathematical type. If it 
were not for the integral term, equation (fT9l) would be a first-order partial differential 
equation of known type (Hamilton- Jacobi), but this additional term, which is non- 
local (an integral along the trajectory of the flow (|T5l) associated with the solution 
v), creates a loss of regularity in the functional equation that generates mathematical 
complications. As a result, existence and uniqueness problems arise as they typically 
do in dynamic games. The topic requires more scrutiny. The next section applies the 
method in the context of an overlapping generation model where time inconsistency 
is typically faced by a utilitarian government. 



5 An overlapping generations growth model 
5.1 The model 

Structure of the population. We consider a continuous time overlapping generations 
model of growth analysis, along the lines of Blanchard (1985). The economy is com- 
posed of overlapping generations of finitely-lived individuals who face a constant rate 
of death tt > Oo At time s, the probability of surviving until time t > s is given by 
g-7r(i-s) consequently, the expected life is t7re~'^^^~^^dt = K At each instant a 
large cohort of identical individuals is born at a normalized rate of 1 so that the total 
number of individuals born during a small time interval [^1,^2) is ^2 — ^i- Because the 
cohort is large, there is no uncertainty on how the cohorts's size and the total pop- 
ulation vary over time. A cohort born at time r (the r-vintage) has a geometrically 
declining size which, as of time t > r, is equal to e~'^^^~'^\ At each point of time t, the 
size of the population is constant and it is given by jl^e-^^^-^Ur = i. The time-t 
expected lifetime utility of a vintage-r individual (r < t) is, as in Yaari [50] . 

/oo 

where 5 > is the constant pure rate of time preference and c(r, s) is the consumption 
rate of an individual born at time r, as of time s >t. The utility function of a newly 
born individual from vintage— r is then 

r(r,r)= / e-(^+")(^'")ln(c(r,s))cis. 



whereas solving ((23|) gives the function g{k) = j^e '^'^'^ log (fc) + <j where c is a constant, and the 
resulting equilibrium policy is 

This example illustrates the strategic motive of saving since the equilibrium marginal propensity to 
consume is low relative to the naive policy. 

^^As mentioned by Blanchard [12], the individual's rate of death can also be interpreted at the rate 
of extinction of a dynasty. With this interpretation, the perpetual youth assumption [p constant) 
seems more acceptable. The mathematical analysis suggests that the indeterminacy result does not 
require the perpetual youth assumption. 
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Technology. The technology is represented by a constant return to scale pro- 
duction function depending on two factors of production, aggregate capital K and 
aggregate labor. From above, the size of the population is constant and assuming 
further that labor supply is inelastic, the production function (net of depreciation) is 
a continuously differentiable and concave function of aggregate capital stock f{K). 
As in Section [2], since capital and output are the same commodity, capital can be 
invested or consumed and the investment rate is 

^ = fim) - C{t) (24) 

where ^ 

C{t) = I c{T,t)e-^^'-^^dT (25) 

J — oo 

is the aggregate consumption at time t. 

The social criterion. We consider a social planner maximizing a utilitarian crite- 
rion balancing the lifetime utilities of the current population and the unborn genera- 
tions. The planner is concerned with the generations' welfare from the present (time 
to = 0) onward and considers the alive individuals as if they had just been born so 
that the criterion takes the form 

fOO pO / POO \ 

/ e-P^T{T,T)dT+ / e-('^+")Mn(c(r,s))rfsMr. (26) 

Jo J~oD \Jo J 

The first term of the above criteria discounts back to time the expected lifetime 
utility of unborn generations using the social discount rate p > 0. The second term is 
the remaining expected lifetime utility of the individuals who were born in the past 
and are still alive at time 0. Notice the asymmetry of the treatment of the unborn 
cohorts relative the the surviving ones in the criteria (l26ll . The later 's utilities are 
discounted back to current time whereas the former are discounted back to their birth 
date. Unlike the criteria of Calvo and Obstfeld [13] where symmetry is assumed, the 
absence of symmetry in the criteria (126|1 creates a time inconsistency due to the 
dependency of the planner's utility flow on the planning time. To see this, fix the 
planning time to = 0, change the order of integration in 0261) and make the change of 
variable from vintage r to age n = t — t, to get the welfare function 

POO 

/ e-'''{Uc,0,s)+Uc,0,s)}ds. (27) 
Jo 

where ju(c, 0, s) is the time s utility flow attributed to the unborn cohorts defined for 
any t < s by 

ps—t 

ju{c, t,s)= / e-(^+"-'')" ln(c(s - n, s))dn, 
Jo 

and where jb{c,0,s) is the time s utility flow attributed to the surviving cohorts 
defined for any t < s by 

POO 

j^{c, t,s)= / e-(^+"-'')"e-(^-^)(^-*-") ln(c(s - n, s))dn. 

J s-t 
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Alternatively, if the planning time is to = ^ > 0, a similar calculation shows that the 
planner's criterium becomes 

/oo 
e-'^'-'^ {Uc,t,s)+Uc,t,s)}ds. 

When S = p, time consistency obtains because the utility flow from the perspective 
of the planning times to = t becomes 

POO 

ju{c,t,s) + jb{c,t,s) = / e~''"ln(c(s - ra, 

Jo 

and therefore it is independent from the planning time. However, so long as 5 > p the 
planner faces a time inconsistency problem because the utility flow ju{c, t, s)+jb{c, t, s) 
depends explicitly on the planning time to = t- 



5.2 The centrally planned economy 

Beginning with a level of capital K{0) at the planning time to = 0, the planner 
maximizes the criterion (l27|l under the budget constraints and (!25|) . It is useful 
to partition the planner's problem into two tasks each of which will take the other 
as given. These two tasks are executed by two flctitious planners, the intra-period 
planner and the metaplanner. The intra-period planner takes as given the aggregate 
consumption and is in charge of allocating the aggregate consumption across the 
surviving cohorts. The metaplanner on the other hand takes as given the path of intra 
cohorts' allocation rules of aggregate consumption and is in charge of the aggregate 
investment decision over time. Notice that both planners face a dynamic decision 
problem. Let us now describe more formally the task of the intra-period planner and 
the metaplanner. 

At the planning time is to = 0, the intra-period planners commitment allocation 
of C{s) at time s > is obtained by maximizing 

PS POO 

Jo J s 

under the budget constraint C(s) = c{s—n,s)e~'^^dn. The aggregate consumption 
expenditure C{s) is exogenous to the intra-period planner and the optimal commit- 
ment allocation is 

g(p-(5)n 

c(s - s) = 7r((5 + vr - p) — — —-. — rCis), for n < s, (28) 

g(p-<5)s 

cis — n,s) = Ti{5 + Ti — p) r — 77- ^C(s), for n > s. (29) 

^ ' ^ ^' -K + {5 - p)e-^^+'^-P> ^' ^ ' 

This allocation rule is however not robust to re-optimization because at the planning 
time to = t > 0, the intra-period planner's new commitment allocation is in general 
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not aligned with the allocation that the planner committed to at time to = 0. For 
example, at the planning time to = the intra-period planner allocation is 



Q,=o(t, t) = n{6 + n- p)—-^^—-^-^-^^^C{t) 

for the cohort which is born at time t, but from the perspective of the planning time 
to = t, the commitment allocation for the same cohort is the egalitarian allocation 

Ct,=t{t,t) = irC{t). 

In order to focus on a simple class of equilibria, we restrict our analysis to the set 
of linear and stationary allocation rules for the intra-period planner of the form 

c{t-n,t) = (p{n)C{t). 

with 

" oo 



e~^''^{n)dn = 1. (30) 
Under this assumption, the objective fl27|) at the planning time to = becomes 








/oo 
^^nn^~(5~p)s ^^^^^^ ^ 




where L{s) is a function depending only on time s and the aggregate consumption 
C{s), which are exogenous and therefore, it drops out of the intra-period planner's 
problem. Integrating by part this formula, shows that the intra-period planner's 
objective at time to = is given by 

When undertaken from the perspective of the planning time to = t, the same calcu- 
lations shows that the intra-period planner's objective is identical with the objective 
fl3T|) . Therefore, when restricted to stationary linear allocation rules, all intra-period 
planners agree to use the rule ip{-) that maximizes (13 ip under the constraint fl5U|) . 
The solution to this maximization problem gives the allocation rule 

^(s) = ^^-±^ {{6 - p)e-'^ + p) . (32) 
d p + IT ^ 

How important is the assumption of stationary and linear allocation rule? The as- 
sumption of stationarity of the allocation rule is consistent with our view that the 
planner's problem in invariant with the passage of time. The assumption of linearity 
is more questionable because the equilibrium may require, for instance, to have a less 
egalitarian consumption sharing rule in prosperous periods. If we permit non linear 
allocation rules of the form c(t — n,t) = (f{n, C{t)) we will in principle have to cope 
with the time inconsistency of the intra-period planner, the time inconsistency of the 
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metaplanner (which, as we will show shortly, arises even with linear allocation rules) 
and, moreover the (sequential) interaction between the intra-period planner and the 
metaplanner. We do not attempt to address this question fully. Instead, assuming 
linear and stationary allocation rules allowed us to work out a simple example of 
second best equilibrium where the successive intra-period planners agree on the best 
allocation rule and where the intra-period planner's problem is decoupled from the 
metaplanner's problem!^ 

Plugging either the optimal consumption allocation (l32l) (or the egalitarian allo- 
cation) in the planner's criterion (1271) and calculating the resulting integrals yields 
the time s utility flow to the metaplanner 

ln(C(s)) + — ln(C(.)) + M{s) 

TT -|- TT — p 

where M(s) is a function depending only on the variables that are exogenous and 
therefore, it drops out of the metaplanner's problem. Substituting the utility flow into 
the criterion fl27|) and dropping the function M allows to express the metaplanner's 
criterion at the planning time to = solely in terms of the aggregate quantities 

/ ln(C(s))rfs + / ln(C(s))rfs (33) 

>• ^ ' >• ^ ' 

Welfare allocated to surviving cohorts Welfare allocated to unborn cohorts 

After rearranging and normalizing the welfare equation (133|) . the metaplanner's ob- 
jective at the planning time to = is 

\ \xv{C{s))ds. (34) 

Jo d + TT - p 

A similar calculation shows that the metaplanner's objective from the perspective of 
the planning time to = t is 

^ , ln(C(s))ds. (35) 

+ TT — p 

The metaplanner is therefore facing a time consistency problem because the marginal 
rates of substitution between consumption at two dates in the future is changing 
by the mere passage of time. This is exactly the type of time inconsistency that we 
discussed in the sections 2 to 4. In the present context, time consistency is endogenous 



^^Different assumptions on the behavior of the intra-period planner are also possible and will lead 
to an identical dynamic problem for the metaplanner. For example, if we assume that the intra-period 
planners are naive, in the sense that they do not internalize their time inconsistency problem, the 
egalitarian policy allocation rule c(t — n,t) ~ nC{t) will prevail. The metaplanner's problem is then 
identical to the one that results from using the allocation rule The metaplanner's problem also 
remains intact if we assume that the intra-period planner can commit to never change the allocation 
rule decided at time = 0. Under this assumption, the intra-period planner's optimally committed 
allocation rule is the non-stationary and age dependent allocation rule (l28l) . ((29|) . 



19 



since it is created from the structure of the criterion (126!) which asymmetrically treats 
the surviving cohorts (whose lifetime utility is discounted back to current date) and 
the unborn cohorts (whose lifetime utility is instead discounted back to birth date). 
Our planner is therefore different from the time consistent planner of Calvo and 
Obstfled (1998). 



5.3 Equilibrium strategies for the metaplanner 

As in Section HI we denote by v the equilibrium value for the metaplanner. We are 
in the special case when u (c) = In c and: 

h it) = e-(^+-)* + ^ e~^* (36) 

Due to the specific form of discounting in it is possible to rewrite the equi- 
librium characterization as a system of two differential equations for two functions, 
the equilibrium value v, and a function w which determines how the welfare is split 
between the surviving cohorts and the unborn cohorts. The next proposition states 
this result and its proof is given in the appendix. 

Proposition 4 Let a be a continuously differentiable strategy converging to k. If a 
is an equilibrium strategy, then the functions v and w defined by 

v{k) = r ( e-(^+-)* + e'A In {a (/C (a; t, k))) dt, (37) 

Jq \TT + d-p n + d-p J 

Hk) = -^j—J (-e-(^+")* + e-^*)ln(a(/C(a;t,A;)))rft (38) 



satisfy the system 



(^f - ^) ^' - In W) =6v-{5-p) w, (39) 



f - — ] w' = -nv + {p + n)w (40) 



V' 



with the boundary conditions 



(k) = / In / m (42) 

^ ^ p{d + n) ^ ^ 



w 



and the strategy a is given by oilz) = l/v'{k). 

Conversely, let v be a C"^ function such that the strategy a{k) = l/v'{k) converges 
to k. If there exists a function w, such that {v,w) satisfies the system ^3^-1^4^ 
and the boundary conditions (f7?lj-(f^^, then a is an equilibrium strategy converging 
to k 
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Note that, by Proposition [3l the characterization (l39!) - (H0l) is equivalent to the 
more general equation (fT9l) . Thus, when the discount function has the special form 
( |36l) . the non local one dimensional equation (fT9l) becomes a system of two ordinary 
differential equations without non local terms. This reduction is critical for our exis- 
tence result in the next theorem. The proof of Proposition H] is given in the appendix. 

Using the welfare decomposition fl33l) . it can be shown that, given the current 
level of capital K{0) = k, the criterion fl34l) allocates the welfare v{k) — w{k) to 
the surviving cohorts whereas the unborn cohorts' welfare is w{k). To build some 
intuition on the system ( l39l) . ( HOl) . it is useful to consider the case where p = 6, 
in which case equation (136!) shows that the metaplanner is facing a standard time 
consistent investment problem with a constant discount rate S. As a result, the 
system {v, w) is uncoupled in the sense that v can be solved for without knowing w. 
In this case, differentiating fl39p . gives the classical autonomous dynamical system 
describing the evolution of the variables {K, C) 



dt 
dC{t) 



C{t) dt 



When the metaplanner's problem is time inconsistent, the system [K, C) is not 
autonomous anymore and an additional variable W{t) = w{K{t)) is required to de- 
scribe the equilibrium dynamics. Using fl5^ . fHUl) the evolution of the economy can 
be described by the autonomous system {K, C, W) 



dK{t) 
dt 



f{K{t))-Cit), (43) 



1 ^^W-/'(;r(t))_5_(5-p)" 



C{t) dt ^ ^ ^ ' S 

6-p C{t) 



6 f{K{t))-Cit) 



7r\n{C{t))-p{7r + S)W{t)), (44) 



^^.»^ + .ln(C(t))-p(. + .)l^(t).) (45) 



dt 6 \ Cit) 

The dynamics of interest are the one where the economy naturally tends to a 
steady state, that is a condition of the economy in which the aggregate level of 
output, capital and consumption do not change over time. The following theorem 
provides our main result on existence of multiple steady states 

Theorem 5 Consider any k E I where / = {A; | < f'{k) < 6}. Then there exist 
an equilibrium strategy, defined on some neighbourhood flofk and converging to k. 

This result shows the existence of a continuum of steady states of the economy. 
The result is proved in the appendix and it is obtained by using the Central Manifold 
Theorem [T7|. Associated to the multiple steady states is a continuum of equilibrium 
strategies each of them generating a path of aggregate capital stock. The equilibrium 
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multiplicity is expectation driven in the sense that if all metaplanners agree that 
ki E I then it will and if all metaplanners agree that /c2 € / is a steady state, then 
it will as well. The multiplicity occurs because the subgame perfection condition is 
not sufficient to pin down how the metaplanners coordinate their beliefs on splitting 
the resources between the surviving cohorts and the unborn ones. When the social 
and private discount rate are equal, the interval I becomes one point and the econ- 
omy's capital stock converges then to its modified golden rule level k"^^ defined by 
/' (fc*"^) = 5. The next step is then to evaluate the efficiency of these equilibria. In 
particular, one may wonder if conservative equilibrium policies (i.e. those generat- 
ing higher steady states level of capital and consumption) Pareto dominate the less 
conservative equilibrium policies. The distant unborn generations clearly favor the 
conservative equilibria since the steady state consumption is larger. However, the 
preference of the currently surviving cohorts is ambiguous because current consump- 
tion must decrease in order to achieve a higher steady state of capital and as a result, 
welfare may be lower. While evaluating efficiency at this level of generality does not 
seem obvious, equilibrium ranking is nevertheless possible if one allow the metaplan- 
ners to revise their belief at later stages of the game. So far, we implicitly ruled out 
renegotiation in our definition of equilibrium policies since beliefs were fixed once for 
all at an ex ante stage of the game. This assumption may be justified when it is very 
costly to renegotiate but it seems inappropriate in our context since it is conceivable 
that the metaplanner may reconsider at future stages their coordination with future 
metaplanners. Following an argument initially introduced by Farell and Maskin [27] 
and Bernheim and Ray ^lOj we illustrate in the next subsection how giving the plan- 
ner the ability to reconsider his beliefs (present and future) - allows to shrink the set 
of subgame perfect equilibria. 

5.4 Renegotiation-proof equilibrium 

In order to select a subgame perfect equilibrium, we use the notion of renegotiation- 
proof equilibrium introduced by Farrell and Maskin [27] in the context of a two players 
repeated game. The notion has been used in a multiple selves infinite horizon game by 
Kocherlakota [38] (see also Asheim [2] for an alternative refinement) and we provide 
a local definition of this notion. 

Definition 6 An equilibrium strategy a converging to k is locally renegotiation-proof 
(l.r.p.) if there exists an open neighborhood Q of k such that for all k* G Q, the 
equilibrium policy a* converging to k* satisfies 



for all fco G fi. 

This definition says that if an equilibrium policy is locally renegotiation-proof 
then, from the perspective of the distant metaplanners, a small perturbation of the 
policy, inducing a shift in the steady state level of capital is dominated by the status 




(46) 
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quo. To see this, notice that since the l.r.p. equihbrium pohcy a is converging to k 
then, for all in the domain of definition of ex, there exists a time T > such that the 
flow (/C (a; t, ko) ,t > T) belongs to VL. Therefore equation (H6l) with ko = )C (a; s, ko) 
says that, for any s > T the metaplanner at time s prefers to keep the strategy 
a rather than switching to a neighboring rule a*. As a result, if the metaplanners 
are restricted to switch policies by small increments rather than by more drastic 
deviations, only l.r.p equilibria are credible. The next proposition says that the local 
renegotiation-proof condition rules out some subgame perfect equilibria and allows to 
identify the steady state of the economy. 

Proposition 7 Any equilibrium strategy a converging to some k satisfying < 
f'{k) is not locally renegotiation-proof: distant metaplanners will always agree to 
switch to a new equilibrium policy converging to k* where k* is slightly above k. 

The implementation of the switch can be done easily. Along an equilibrium path 
converging to k and induced by the policy a, the time T metaplanner (where T is 
sufficiently large) suggest to the successors to switch to a new equilibrium strategy 
a* converging to k* where k* is slightly larger that k. Proposition [7] says that, for 
T sufficiently large, the metaplanners have an incentive to play the new equilibrium 
policy (J* instead of keeping a. In summary, all the future metaplanners (from time T 
onward) coordinate their beliefs on a new equilibrium a* . Since the metaplanner at 
time is aware that this will happen, it does not make sense for her to start playing 
a and as a result, she will use the strategy a* . The same argument can however be 
made iteratively to a* and the future metaplanners will have interest to deviate from 
a* to a neighboring strategy a converging to k where k is slightly higher than k* . The 
metaplanner will then apply an equilibrium strategy converging to the highest steady 
state level of capital defined by /' {k^ = pf^- Consequently, the steady state level 
of capital depends on the planner's discount rate and the individual discount rate. 
The more patient is the planner (lower p), or the consumers (lower 6), the higher is 
the steady state level of aggregate capital stock stock. This result must be contrasted 
with the Calvo and Obstfeld (1988) time consistent government where the the private 
rate of time preference is irrelevant for the steady state level of capital stock. Note 
that when the planner's discount rate p converges to 0, the capital stock converges 
to its golden rule level k^ defined by /' (k^) = 0. 

5.5 Decentralization of the equilibria and fiscal policy 

The government begin at time to = with capital stock K{0) and tries to implement 
the second best allocation that we denote {K*(t),C*{t),W*{t))t>o and such that 
the steady capital stock is K*{oo) = k E I. This centralized allocation path gives 
rise to the real interest rate rl = f\K*{t)) and the real per capita wage = 
{f{K*{t)) - K*{t)f' {K*{t)))-K. We recall that {K\C%W*) solves the autonomous 
system f H5]) . (Hll) and (HSI) and that the resulting disaggregate allocation is given for 
any r < t by 

c\r,t) = ^{t-T)C\t) (47) 
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where the function is defined by ( 132!) . Differentiating equation (H7I) with respect to 
time and using equation (l44l) . shows that the the disaggregate allocation obeys 



1 dc*{T,t) , , ^'{t-r) 

- r* -6 + ijj{t) + f (4^ 



c*(r,t) dt * ^ ^ V5(^ - 

where ip' is the derivative of ip and where 

= - - p) ^ - w) - p (. + . 

When p = 6, the intra-period planner uses the egalitarian allocation rule and the 
metaplanner's problem admits a first best solution since it is time consistent and we 
have (f' = and = 0. As a result, when p = S, the disaggregate consumption obeys 

1 d. c*(r,t) _ * _ r 
c*(r,t) dt 't 

The market economy is as the one of Yaari [60] and Blanchard [!l2j. In the ab- 
sence of bequest motives, the mortality risk creates a role for annuity contracts. A 
competitive insurance market supplies the market with actuarially fair annuities that 
have an instantaneous rate of return of + 7r, where is the market real interest rate 
as of time t. An individual born at time r who had accumulated a positive financial 
wealth a(r, s) at time s > t faces the risk of dying before spending it. Investing in 
the annuity permits this individual to receive a payment (r^ + 7r)a(r, s) is he survived 
at time s and, in exchange, the insurance company will collect a(r, s) if the individual 
dies at time s. On the other hand, if the individual is a net borrower, the lending 
institution faces the risk that the individual dies before being able to repay the loan. 
The individual will issue then an annuity on which a fiow of interest of (rs + 7r)a(r, s) is 
paid to the insurance company but in which the debt will be forgiven if the issuer dies. 
Individuals will always invest (or borrow) their financial wealth with the insurance 
company: If they are lenders, the rate of return is higher with the insurance company 
during their lifetime and if they are borrowers, they have to insure themselves against 
death because negative bequests are prohibited. 

The fiscal instrument at the disposal of the government consists of government, 
age dependent non distorting lump sum taxation (that is an income tax with no slope) 
and age dependent distorting capital income taxes (or subsidy). Denoting by a(r, t) 
the total assets of the vintage r agent at time t, the individual optimization problem 
is 

/oo 
(c(r,s))ds 



under the constraint 
(ia(r, t) 



((1 - r/(r, t))rt + 7r)a(r, t) - c(r, t)+uot + P{t, t) 



dt 

where //(r, t) is the tax rate at time t for an agent born at r and, /3(r, t) is the transfer 
fiow received by a vintage r agent at time t and rj (resp. Ut) is the real interest (resp. 
wage) rate prevailing at time t. The individual optimization problem has the first 
order condition 

1 dc{T, t) 



c{t, t) dt 



{l-r]{T,t))rt-5 (49) 
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for all r < t and yields the policy 



c(r, t) = {6 + tt) [a(r, t) + h{T, t) + b{T, t)] 



(50) 



where /i(r, t) is the human wealth of any individual born a time r as of time t, 



and where 6(r, t) is the present value of government transfer flows expected by a 
vintage r agent as of time t, 



The problem of the government is to credibly commit to a fiscal policy path {(//(r, t); f3{T, t))^<^} 
such that, when the shadow prices {rl}t>o and {u^}t>o are expected, the individual 
optimal consumption paths coincide with the desired allocation that is, 



for alH > and t <t. 

It is impossible to decentralize the allocation c* without government intervention 
because it is a second best inefficient allocation. More precisely, in the absence of 
fiscal intervention [t] = f3 = 0), summing at any date the individual optimization 
policies over all surviving cohorts allows to identify the aggregate dynamics. The 
resulting steady state of the market economy is Jcm where is the unique solution 
of the equation 



not sufficiently accumulate the capital stock relative to the desired centrally planned 
economy. 

In order to decentralize c*, let us first assume that we can find a fiscal policy {t], [3) 
such that the resulting individual consumption matches the desired allocation at time 
0, that is, c(r, 0) = c*(r, 0) for all r < 0. If we want the decentralization to follow 
through for later times t > 0, we must check that c(r, t) = c*(r, t) for any r < and 
t > 0. This condition will hold if and only if the optimality condition ( l49l) is identical 
to the disaggregate allocation dynamics equation (l48l) which results in 




ds 




(51) 



c{T,t)=C*{T,t), 




V{t - r) 



This condition can be expressed as 




(52) 
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where n = t — r is the age of the taxable consumer. Equation (152|) uniquely identifies 
the required capital income tax rate since it is the only distortional tax rate prompting 
the necessary credibility of the taxation policy. 

When the planner's discount rate is equal to the individual discount rate; p = S, 
the time inconsistency disappears, the centrally planned allocation becomes first best 
and the capital income tax rate is zero {ip = if' = 0). When 6 p, the tax policy 
is age dependent. However, if the intra-period planner uses the egalitarian allocation 
rule instead of the allocation rule ( l32l) . the tax rate becomes age independent and it 
given by rjit) = —ip{t)/r^. 

At date t = 0, the government has no flexibility in the choice of the tax rate policy 
T] because it is given by equation (152!) but it will use the lump sum taxation in order 
to match the initial individual consumption c(r, 0) with the disaggregate allocation 
c*(r, 0). To see this, use the policy formula (|SU]) for an agent born at time r = to 
get 

c(0, 0) = (5 + tt) [a(0, 0) + /i(0, 0) + 6(0, 0)] 

and, assuming that a(0, 0) = 0; people inherit neither capital stock nor debt when 
they are born, we see that c(0, 0) = c*(0, 0) if and only if 

&(0,0) = -^c*(0,0)-/i(0,0). (53) 

+ TT 

Equation fl50l) also gives a similar formula for the present value of transfer flows to 
the consumers born at r < 0, 

K-r. 0) = r^c*(r, 0) - /i(r, 0) - a(r, 0) (54) 

where a(r, 0) is the total assets (capital stock and government debt) of a consumer 
born at time r as of time 0. 

Finally, the government will face the same decentralization problem for the gen- 
erations that will be born at r > and equation fl50p gives again the formula 

h{T,r) = -^c*{T,r)-h{T,T). (55) 
d + vr 

The right hand sides of fl53p . (1541) and (1551) are known to the government whereas 
the left hand sides are related to the transfers flow (3 through the integral formula 
fl5T|) . Any lump sum transfers flow /3 satisfying fl53|) . fl5^ and fl55|) decentralizes the 
desired plan and hence, unlike the capital income taxation, the lump sum taxation is 
indeterminate. 

Notice that the flscal policy that decentralizes the plan c* is time consistent: even 
if the government is given the opportunity to revise the path of flscal instruments over 
time, it has no interest in doing so in the absence of new information because the 
problem of time inconsistency is sorted out at the ex ante stage of central planning. 

Evaluating equation (jUj) at the steady state of the economy gives \mit/oo''Pif) = 
ip = 6 — f'(k) > and therefore, the steady state tax rate becomes stationary and it 
is given by 

limri(t-n,t) = ^^(f'(k)-5-^^Y (56) 
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This result shows that it is required to tax (or subsidize) capital income in the 
long term and it is in stark contrast with representative infinitely lived agents models 
where the capital interest taxation is not an optimal instrument in the long term. 

The long term capital income tax rate (|56l) require to subsidize the old cohorts 
and tax the young cohorts. In fact, it can be shown that there exist a threshold age 
n that the government subsidizes (resp. taxes) capital income for cohorts older (resp. 
younger) than h years. For instance, if the planner decide to decentralize the l.r.p. 
allocation converging to k with f'{k) = pf^, the cutoff age is given by 

n = — - m 



5 \6 + Ti 

Notice that if the intra period planner decides to use the egalitarian allocation 
(^(n) = TT, there is no need for an age dependent capital income taxation. Under this 
assumption, the government will simply use the long term uniform capital income 
subsidy rate 

IT 6 — p 

V = — — 

pir + p 

and finance this subsidy by lump sum transfers and/or by issuing bonds. 

We must emphasize that if the government has access to a commitment technolo- 
gies, it will adopt an allocation consistent with the asymptotic discount rate of the 
discount function that is p. The result of Calvo and Obstfeld [Hj suggests then 
that long term capital income taxation is not required. Therefore, in our model, the 
capital income taxation is entirely driven by the time inconsistency friction faced by 
the government. 



6 Conclusion 

The central concern of this paper has been the attempt to define a methodology to an- 
alyze a capital accumulation dynamic game with non constant discount in continuous 
time and under the assumption of non commitment. The methodology only requires 
standard differential calculus techniques and the resulting equilibrium characteriza- 
tion is a generalization of the HJB equation. Using a special fixed point theorem, 
the central manifold theorem (Carr [Hj), we were able to prove existence of multiple 
equilibria for a specification of the discount function. We perceive the connection of 
the methodology with the central manifold theorem as a good news: In fact the cen- 
tral manifold theorem comes with an approximation method (See Theorem 2 of Carr 
[T7] ) that opens the door to computational policy experimentation. Taken overall, 
the method seems to be applicable to a broad set of time inconsistency problems. We 
consider an application to the dynamic allocation problem for a forward looking util- 
itarian government in an overlapping generations economy. When the social discount 
rate and the private discount rate are distinct, the optimal command is time incon- 
sistent and the government becomes strategic. We find that there are multiple steady 
states for the economy but that the (local) renegotiation-proofness requirement se- 
lects one of them. The strategic allocation can be implemented in a market economy 
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if the government deploys distortional capital income taxation. Our main point here 
is that the time inconsistency friction that arises from the planning problem creates 
by itself a role for capital income taxation. The analysis of our overlapping genera- 
tions model has been limited in several important respects. In particular, we did not 
impose any restriction on the fiscal tools, neither directly as in the Ramsey approach 
of optimal taxation nor indirectly as in the Mirrlees approach to optimal taxation. 
The non restrictiveness of the fiscal tools allowed us to achieve a separation between 
the allocation problem and the taxation problem. As a result, we shut down the time 
inconsistency generated by the interaction between the government and the private 
agents. An unresolved question that we plan to investigate in future work is to see 
how the preferences based time inconsistency interacts with the time inconsistency 
due instruments insufficiency. This can be done in the context of the Mirrleesian ap- 
proach to optimal taxation and the main difficulty is to model a game with multiple 
private agents and multiple successive governments. 



A Proof of Theorem 2 



A.l Preliminaries 

Before proceeding with the proof of the theorem, let us mention some facts about the 
flow /C defined by ( |T5i) . ( |T6l) . To make notations simpler, we will use /C (t, k) instead 
of /C (cr; t, k) when the omission of the dependency on a causes no ambiguity. 

Note first that, since equation (fT5|) is autonomous, i.e. the right-hand side does 
not depend explicitly on time, the solution which takes the value k at time coincides 
with the solution which takes the value /C [t, k) at time t > 0. This is the so-called 
semi-group property, which is stated precisely as follows 

}C{s,}C{t,k)) = IC{s + t,k) (57) 

Next, consider the linearized equation around a prescribed solution t ^ IC{t, k) 
of the nonlinear system (fT5l) . namely: 



= (/' (/C k)) - a' (/C (s, k))) k, is) (58) 

as 

This is a linear equation, so the fiow is linear. The value at time t of the solution 

which takes the value k at time is 7l{t)k, where the function TZ : R (0, oo) 
satisfies: 

^ = (r(/C(.,A:))-a'(/C(.,fc)))7^(^) (59) 

7^(0) = l. (60) 

From standard theory, it is well known that, if / and a are C'^, then /C is C''~^, 
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and: 

die {t, k) 

dk 
die {t, k) 



n{t) 

-nit) {f{k)-a{k)) 



dt 

Let us now turn to the actual proof of Theorem [2J 
A. 2 Necessary condition 

Given a Markov equihbrium strategy cr, we define the associated value function v{k) 
as in formula fll7p 

/>oo 

v{k):= h{t)u{a{IC{t,k)))dt (61) 







Differentiating with respect to k, we find that: 

f°° die 

v'{k)= I h{t)u' {a{lC{t,k)))a' {lC{t,k)) — {t,k)dt 







oo 



h{t)u' ((t(/C (t, k))) a' (/C (t, k)) n{t)dt 

Since a is an equilibrium strategy, the maximum of Pi [k, a, c) with respect to 
c must be attained at a{k). The function Pi itself is given by formula (IT^ . where 
ko{t) = fC {t, k) and where ki is defined by ( TTUj) and (ITT]) , so that kiit) = TZit) (cr(/c) — c). 
Substituting into f|T3l) . we get: 

Pi {k, cr, c) = u (c) — u (cr(A;)) 

/■oo 

+ / h{t) u' {a (/C (t, A;))) a' (/C (t, k)) TZ (t) ((t(A:) - c) 



Since m is concave and differentiable, the necessary and sufficient condition to 
maximize Pi{k, cr, c) with respect to c is 

/■oo 

u'(c) = / h{t)u' (a(/C (t, k))) a' [K, (t, k))n{t)dt 



which is precisely v'{t,k), as we just saw. Therefore, the equilibrium strategy must 
satisfy 

u' {a{k)) =v'{k) 
and, substituting back into equation ( 1611) . we get equation (jlEjl . 

A. 3 Sufficient condition 

Assume now that there exists a function v satisfying (IIEI) . and consider the strategy 
a = i o v'. Given any consumption choice c & R, the payoff to the decision-maker at 
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time is 



Pi {k, a,c) = u (c) — u {(j{k)) 

POD 

+ h{t) u' [a (/C (t, k))) a' (/C (t, k)) n (t) {a{k) - c) dt 



= u{c)-u {a{k)) + v\k) {a {k) - c) 
= u{c)-u {a{k)) - u' {a {k)) (c - a (A;)) 
<0, 

where the first equahty follows from the definition of 7^, the second equality is ob- 
tained by differentiating v with respect to /c, the third equality follows from the 
definition of cr, and the last inequality is due to the concavity of u. Observing that 
Pi{k,a,a (k)) = 0, we see that the inequality Pi{k,a,c) < implies that c = a (k) 
achieves the maximum so that a is an equilibrium strategy. 

B Proof of Proposition [3] 

Let a function v : R ^ R he given. Consider the function<yj : R ^ R defined by 

POO 

ip{k)=v{k)~ h{t)u{a{}C{a;t,k)))dt (62) 



where cr (fc) = i {y' {k)). Consider ip (t, k), the value of ip along the trajectory t 
K, (cr; t, k) originating from k at time 0, that is 

V^(t,A;) = <^(/C {t,k)) 

POO 

= v{K,{t,k))- / h{s)u{a{lC{s,K,{t,k))))ds 







oo 



V (/C (t, k)) - / h{s)u ((t(/C (s + t, k))) ds 
Jo 

POO 

v{}C{t,k))- / h{s-t)u{ai}Cis,k)))ds 



where we have used formula ( 1571) . 

We compute the derivative of this function with respect to t: 

^ (fc, t) = v' (/C (t, k)) [f (/C (t, k)) - z (a (/C (t, k)))] 

POO 

+ u{a{lC{t,k)))+ / h'{s-t)u{a{lC{s,k)))ds 



Prom the definition of z, we have 
u {i {v' (/C (t, k)))) - v' (/C (t, k)) i {v' (/C {t, k))) = sup {u (c) - v' (/C (t, A:)) c} 
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Substituting in the preceding equation, and recalling that a = i ov' gives 
^ {k, t) = sup {u (c) + v' (/C {t, k) if (/C (t, k)) - c))} + [ h'{s - t)u {a{IC (s, k))) ds 

(^t c Jt 

POO 

= sup {u (c) + v' {K, (t, k) (/ (/C (t, A;)) - c))} + / ((t(/C (s, /C (t, A;)))) ds 



where the second equality is obtained by using a change of variable and formula (1571) . 
If (fTOll holds, then the right-hand side of the last equation is identically zero along 
the trajectory, so that ip {k, t) = ip {k) does not depend on t. Letting t — > oo in the 
definition of ip, we get: 

^{k)= lim \v{JC{t,k))- [ h{s)u{a{JC{s + t,k)))ds 

t >oo 







oo 



= v{k) - h{s)u {a {k))) ds = v{k) -u (/ {k)) / h (s) ds 
Jo Jo 

and hence, if fl20|) holds then, ip = ip = and equation fllEl) holds. 

Conversely, if v {k) satisfies equation fllEl) . then the same lines of reasoning shows 
that equation f|T9|) and the boundary condition are satisfied. 



C Proof or Proposition [4 

The proof uses the following: 

Lemma 8 Let a (k) be any convergent Markov strategy. Denote its steady state by 
k. Let ho : [0, oo] — > R be any function with exponential decay at infinity, that 
is 

ho{t) < Ce-''' 

for some positive constants z/ > and C > 0. A function I satisfies 



I' {k) if {k) - a (k)) + h'o {t) In {a {}C {a; t,k)))dt + ho {0) In {a {k)) = (63) 

Jo 

for all k and, the boundary condition 

POO 

I{k)= / ho{t)\nf{k) (64) 

if and only if it satisfies 

POO 

I{k)= I ho{t)\n{a{lC{a]t,k)))dt. (65) 



Proof. We argue as in the preceding proof. For any function /, consider the 
function ip {k, t) defined by 

/oo 
ho{s — t) In (o"(/C (cr; s, k))) ds. 
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Differentiating with respect to t, we get 

^ = r (/C (a; t, k)) if (/C (a; t,k)-cr (/C (a; t, k)))) 



+ J h'ois - t) In ((t(/C {a; s, k))) ds + (0) In {a{K, {a; t, k))) . 
Using a change of variable and equation (1571) . notice that 

^ (a; k,t) = I (/C {a; t, k)) - I ho{s) In ((t(/C (a; s, /C (a; t, A;)))) ds. 



Therefore, if ( 165|) holds, then ■?/' is identically zero, and so is its derivative so 
( l63l) holds. Conversely, if (l63l) holds, then ^ vanishes, and ip {k, t) = i/j (k) does not 
depend on t, so that 

if) {k) = lim < / (/C (cr; t. A;)) — / /;.o(s) In (cr(/C (a; s, /C (cr; t, /;:)))) (is 



/■CO 

I{k)- / /io(s)ln(a(fc))dt 



If in addition, (16^ holds then (l65l) holds. ■ 

Now let us turn to the the proof of Proposition HI To simplify the notations, we 
shall write h {t) in the following way 

h (t) = ^e-(^+")* + (1 - ^) e-P' 

where 

e= . i-e 



71 + 6 — p' 71 + 6 — p 

Suppose that a is an equilibrium strategy converging to k. Proposition [3] shows 
that the value v defined by fl37|) satisfies f|T9l) . fl20|) . After performing the optimization 
in c and substituting u{c) = ln(c), the right hand side of f|T9l) becomes the left hand 
side of f l39l) . Using the definitions of f and w given by fl37|) and fl38l) . it can be checked 
that the left hand side of f[T^ coincides with the linear combination of v and w given 
in the right hand side of fl5Ul) . Therefore, we proved equation f l5^ . The boundary 
condition (HTj) is proved by integrating (137|) after replacing the consumption flow 
cr (/C((t; t. A;)) by the steady state consumption level / (k). 

To get equation (HOj) . we use the preceding Lemma with cr (k) = i{v'{k)) = 
1/v' (k), hQ{t) = (1 — 6') (e^''* — e~(^"'"'^)*) and we get an equation for I{k) = w{k) 
given by 



w' {k) (^f {k) - j = - (1 - 0) ((5 + vr) e-(^+-)* - pe-''*) 7i (a (/C (a; t. A;))) rft 

(66) 

and a boundary condition 



w 



(k) = a-0)(-- j^) In / (k) = In / (k) . 

\p d + 71 J p{d + 7r) 
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The right hand side of (166!) can be written as the hnear combination — vrf (A;) + (p + 
Tr)w{k) and thus, we proved equation (l40l) . 

Conversely, suppose vi and wi satisfy the equations (l39l) and (HOl) . together with 
the boundary conditions (HTl) and (H2ll . with the strategy ai = l/v[ converging to k, 
so that 

Consider the functions: 

POD 

V2{k)= (^e-(^+")* + (l-^)e-''*)ln((Ti(/C(ai;t,A;)))dt (68) 
Jo 

POO 

W2{k) = {l-e) (-e-(^+")* + e-''*)ln((Ti(/C((Ti;t,A:)))dt (69) 
Jo 

Applying the preceding Lemma with I = V2 and I = W2 successively, we have 



v'2 {k) (/ - ai) + {d{8 + 7r)e-(^+-)* + (1 - ^) pe-''*) In (^i (/C (fXi; t. A;))) dt + In (^i (A;)) = 

^ (fc) (/ - (yi) + {l-e) (-e-(^+-)* + e-P') In (ai (/C (ai; t. A;))) rft = 
^2 (A:) = (l-^^) (i-^) ln/(fc) 

and hence 

'"'2 {k) if - ai) + In (0-1 (A;)) = 6v2 - {6 - p)w2 
^2 (/ - (Ti) = -7rf2 + (p + 7r)u;2 

Substracting (1701) from (1671) . and setting w = t>i — t>2, w = Wi — W2, we get: 



/ - ^) t'' = '5^2 - (5 - p)w2 



/ - ^ j = -7rt;2 + (p + 7r)u72 



(^)=0 



Obviously v = w = Q is a, solution. In the next Lemma, we show that it is the 
only one, so Vi = V2 and Wi = W2. Equation (168|) then becomes: 



POO 

{k)= / (0e-(^+")* + (l-^)e-''*)ln(cTi(/C(ai;t,A;)))rft 
Jo 



which is precisely equation (IE) for h (t) = {Oe ('^+'^)* + (1 — 0) e and u (c) = Inc. 
Since vi satisfies (IE), the strategy ai is then an equilibrium strategy. 



33 



Lemma 9 If {v,w) is a pair of functions, which are continuous on a neighbourhood 



f2 of k, continously differentiable for k ^ k, and which solve {71_) for k ^ k, then 
V = and w = 

Proof. Set f {k) — l/v[ [k) = {k). Note that (p{k) —>■ when k —>■ k. Since vi is 
C^, in fact C^, and v[ (k^ ^ 0, the value (p {k) changes signs when k crosses k. The 
system can be rewritten as: 

({>v' \ _ ( 5 —{5 — p) \ ( V 



(pw' I \ — TT p + TT J \ ^ 

The characteristic equation of the matrix on the right-hand side is: 

X^-{p + 6 + n)X + p{6 + 7i) = 

and the roots are X = p and X = 6 + tt. Changing basis, we can rewrite the system 
as: 

'pV P \fv 

ipW J \0 S + n J \W 



where V and W are suitable linear combinations of v and w. The solutions are 



V (k) = Ci exp (p [ -^dl] , W{k) = C2 exp ({5 + tt) / -^dl 

\ Jko fin J V Ao f ( 



'fco ^i^; / \ Jko 

where Ci, C2 and k^ are constants. Since l/ip>{k) ±00 when k ^ k, and both 
signs occur, one for k < k and the other for k > k, the only way we can get V (k) = 
W{k) =0 is by setting Ci = C2 = 0. ■ 



D Proof of Theorem 5 



By Proposition HI it is enough to show that the system fl39l) - P0l) with boundary 
conditions fj4Tl) - P2l) . has a solution {v,w) where v and w are required to be and 
such that the resulting capital stock 

dk 1 

has the property that lim^^oo k{t) = k when k^ is sufficiently close to k. 

We will give the proof in several steps. First, we make a sequence of change of 
variables and change of coordinates that simplify the system fl5I?l) - fHni) . Second, we 
apply the Central Manifold Theorem (e.g. [T7j) to show existence. Third, we conclude 
the proof by demonstrating the estimate of the steady states k given in Theorem [51 

D.l Change of variables 

Let us begin with the following useful lemma which proof is omitted. 
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Lemma 10 The equation x — In (1 + x) = /i has no solution for yU < 0. For fi = 0, 

the solution is x = 0. For > 0, there are two solutions Xi (fi) and X2 (/i) with the 
following properties: 

(a) Xi is decreasing, Xi (0) = 0, and Xi [fi) — > —1 when fi 1 — > 00 

(b) X2 is increasing, Xi (0) = 0, and X2 (/i) — > 00 when /i 1 — > 00 

(c) both Xi and X2 are continuous on [0, 00] and C°° on (0, 00) 

Note, that neither Xi nor X2 are differentiable at 0. In fact, if we replace the 
function a; — In (1 + x) = /i by its Taylor expansion near x = 0, we find that the 
equation x — In (1 + x) = /i is replaced by the equation x^ = fi, so that Xi (/i) and 
X2 (/i) are approximated by —-y/JI and ^/JI respectively when ju > is small. 

Introducing now the function defined by 

fi [k) := 6v {k) -{6- p)w {k) - In / [k) (72) 

equation (l39l) becomes 

fv' — 1 — In fv' = fi 
and, using the notation of Lemma [lOl this equation becomes 

f{k)v'{k) = l + Xi{fi{k)), 1 = 1,2 (73) 

Next, differentiating equation (j72ll gives 

f^^' = f (^6v' {k) -{6- p)w' - 

and, using equation fl4UI) to eliminate w' from the above equation yields 

fp' = 6il + Xip))-f' + ^^^[-pin + 6)v + {7r + p)p + {n + p)\nf] (74) 

after replacing fv' by 1 + X (/i). 

Our new system is now fl73l) . fl7^ where k is the independent variable, the function 
/ (k) is given, the functions Xi (/i) and X2 (p) are defined in Lemma [THl and the 
unknown functions are v (k) and p{k). The right-hand sides of the equations ( 1731) . 
(17^ are function of {k, v, p), which are defined and C°° for /c > and p > 0. 

Suppose now we have a solution {v,p), with 

p (k) = 0. (75) 

Then X (^p (A;)) = 0, so that 

v' {k) = 1/f (fc) 
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by equation (173|) . and the right-hand side of equation (17^ blows up unless 

^ ^ p (d) + vr) 

which is precisely our boundary condition fj4Tl) . If both fHTl) and fl75|) hold, the second 
boundary condition P2|) is automatically implied by fl72l) . 
Let us rewrite our new system (omitting the index i = 1.2) 

/(fc)^ = l + X(p) 

/W^ = 5(l + X(p))-/' + i±^[-p(7r + 5)t; + (vr + p)p + (7r + p)ln/] 

with the boundary conditions (HTjl and (!75|) . 

We now turn to a change of coordinate by taking X (p (A;)) = x as the independent 
variable instead of k along the trajectory. In other words, instead of looking for 
functions v (k) and p (A;) satisfying the equations (1751) . (ITll) . we will be looking for 
functions v (x) and A; (x) satisfying 

(it; (iv (i/c 1 + X dk 
dx dk dx f dx^ 

= f^TT^ \ - p (tt + 5) i; + (tt + p) p + (vr + p) In / 

Plugging in the relations p = x — In (1 + x), so that ^ = x (1 + x)~^, yields the 
new system 

dx ^ ^ ^ + ^ D x^k^v 



D ix,k,v 



where the function D is given by 

D (x, k, v) ={1 + x) [-p{7r + 6)v + {tt + S + p) x] 

(1 + x) (tt + 6) [In / (A;) - In (1 + x)] - xf (k) . (7^ 



We pick a point k and we look for solutions ^A; (x) ,v (x)jof the system (1761) . 
( 1771) . defined in a neigbourhood of x = and satisfying: 

^ (0) = A; (79) 
v{0) = ^^^lnf{k):=v (80) 
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We now introduce a new variable s, and we replace the system (1761) . (1771) with the 
following autonomous system 

dx 

— = D{x,k,v) , a;(0) = (81) 

dV dV dx 2 /g\ - /ggX 

ds dx ds ' 

where there are now three unknown functions {x (s) ,k{s),v{s)), defined near s = 0. 
Note that D is as smooth as /' in a neighbourhood of (O, k, v) and that D (O, k, v) = 0. 

Our problem is then to analyze the system (18T|) . (!82|) and (183!) since it is related to 
to the initial system (l39|) . fH0|) through a sequence of smooth change of variables and 
coordinates. 

D.2 Existence 

The linearized system near (O, k, v) is: 



^4) 









V 




'I] 









with (all derivatives to be computed at (O, A;, f)): 

/ 6~f'{k) (^ + P)j^ -p{n + S) \ 



A :-- 



dP 


dD 


dD 


dx 


dk 


dv 























V / 



The matrix A has the eigenvalues (5 — /' (k) ,0,0), and can obviously be put 
in diagonal form (in this case, as a matrix with 1 in the upper left corner, all the 
other coefficients being 0). To make the change of variables explicit, we note that the 
eigenvector associated with the eigenvalue 6 — f (fc) is 




and we set Ei := Span {e}. We also consider the kernel of A, and denote it by Eq: 

Eo:=KerA= A; j | (5 - f {k)) x + {n + p)^j^k - p{7r + 5)v 

Both El and Eq are invariant subspaces of the linearized system (18^ with the 
corresponding eigenvalues being Ai := 5 — /' (k) ^ and 0, and the operator A 
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is diagonal in any base (61,62,63) with 61 G Ei and_(62, 63) G E3. By the central 
manifold theorem (see for instance [17], Theorem there exists a map h{k,v), 
defined in a neighborhood (9 of (k, such that 

and the manifold Ai defined by: 

M := {{ak + (3v + h{k,v) ,k,v) \ {k,v) e O} 



is invariant by the flow associated with the equations (IHTl) . ( I82l) . ( l83l) . The map /i 
and the central manifold are as smooth as /': they are C^, for instance, if / is 
C^. Note that is two-dimensional and tangent to Eq at (0,k,v). Note also that 
there is another invariant manifold A^, which is tangent to Ei at (O, k, v) : it is one- 
dimensional, and it is stable if Ai = 5 — /' (k) < and unstable if Ai = 5 — /' (k) > 0. 
Each of these invariant manifolds gives a different type of solution to the system 
equations (IHTD, (|82|), (|83l). 

We are interested in the solutions which lie on the central manifold A4. They can 
be found by substituting x = ak + Pv + h {k, v) in equations (182!) and (!83|) . yielding 

f^j^^j^h±Jt±Mh^^ k{0) = k (85) 
ds 1 + ak + Pv + h {k,v)' 

^ = [ak + f3v + h{k,v)f , v{0)=v (86) 

while X is found by using the fact that Ai is invariant 

X (s) = ak (s) + l3v (s) + h{k (s) ,v (s)) . 
Eliminating the variable s from (185|) and (!86l) . we get 

di; fik) 



dk 1 + ak + Pv + h {k,v)' 



[k) 



The solution of this initial- value problem is v {k) = ip (k), where ip (fc) = v and 
ip is if is C^, that is, if / is C^(see above). Substituting in x = h {k, v), we get 
X (k) = ak + Pv + h {k, ip {k)). Finally, n{k) = x {k) — In (1 + x {k)) is also C°°, so we 
have found a smooth solution of equations (173|) and fl74|) . as desired. 

Differentiating equations (!39l) and evaluating it at A; = fc, taking into account that 
/ (fc) f' (A;) =1, yields the following 

w' (k) = (6 - f (k)) 



^^Theorem 1 in Carr [T7| requires that the linearized matrix at the fixed point be block diagonal. 
The matrix A is not block diagonal but the theorem can still be applied because the matrix A can 
be transformed into a block diagonal matrix with some appropriate change of basis. 
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D.3 Proving the estimate 

It remains to prove that the strategy cr {k) = 1/v' {k) is convergent. Recall that this 
means that the solutions of the equation 

^ = f(k)- 1/v' (k) , kiO) = ko 
at 

converge to k if ko i sufficiently close to k. Since / (k) = 1/v' (k), is a fixed point 
of the dynamical system, and we want to show that it is an attractor. This means 
that the linearized system at k, namely 



V [kY 



k 



must have k = k as an attractor. In other words, we must have 



v' [k) 



To compute the left-hand side of (JHTl) . differentiate equation (HOI) at set k = k. 
We get 

/' (k) + ^^-21 I w' {k) = -Tiv'ik) + (p + Tl)w'{k). 

v' (k) J 

To find w'{k), differentiate equation fl5Ul) and evaluate t at k = k, taking into 
account that / (fc) v' (fc) =1, to get 

yj' (k) = (S _ /' (k)) . 

^ ^ i6-p)f{k) ^ ' ^ 



Hence 



,'{ky) ^' f{m-p) 

and, switching w'{k) from the left hand side to the right hand side of the last equation 
gives 

^ ' V' (kf S-f'{k) 

This will be negative if the numerator and denominator have opposite signs. Both 
the numerator and the denominators of the right hand side of the last equation are 
increasing functions of k and they change sign respectively at f'{k) = and 
f'{k) = 6. Since p < vr, we have p^^ < 6 and therefore the only interval where 

/' {k) + < is the open interval / = (p^, 5^ 
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E Proof of Theorem [T] 



We will need a preliminary result, the proof of which is quite obvious. 
Lemma 11 Let f {x,y) be a function of two variables such that: 

f ix,x) = if (x) 

Then: 

df 

— = (f' (x) -^p{x) 

To prove that a is not l.r.p, we will establish that there exists e > such the 
inequality ( H6l) is not satisfied for all ko G {k — e,k + e) and k* e {k,k + e). We 
will now introduce a new notation of the equilibrium value as a function of the initial 
point ko, as before, and of the terminal point k. More precisely, we define 

/■oo 

V {ko, k) = h{t)u {a (/C {a; t, ko))) dt 
Jo 

where h (t) is given by fl5Bl) and a is an equilibrium strategy converging to k 
Assuming that V is differentiable with respect to k, we can apply the preceding 
Lemma, with 

and get 

which is positive on the whole allowable interval /. It follows that there exists e > 

for all ko, k in the interval ijz — e,k ^ e). 
^°The function V is defined for any fc* € / by 

V{ko,k*)^ / h{t)u{a* {IC((j*;t,kn)))dt 
Jo 

where a* is an equilibrium policy converging to k* . Of course, there may be multiple equilibria 
converging to k* and in order to define properly V{., .) we need to make an a priori selection of a 
converging equilibrium policy for each fc* G / 
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Now, consider any initial capital level /cq € {k — e.k + e) and steady state capital 
level k* e {k,k + e). By the mean value theorem, we have 

Viko, k*) - Viko, k) = {k* -k)^ {h, k) > 

for some k e {k, k*) and where the inequahty follows from the fact that ^ e — 
e,k-\- e). 
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